Re-mining Topics Popular in the Recent Past from a Large-Scale Closed Caption TV Corpus
نویسندگان
چکیده
منابع مشابه
"Draw My Topics": Find Desired Topics fast from large scale of Corpus
We develop the “Draw My-Topics” Toolkit, which provides a fast way to incorporate social scientists’ concerns and interests into the standard topic model. Instead of using raw corpus with primitive processing as input, an algorithm based on Vector Space Model and Conditional Entropy are used to connect social scientists’ subjective want and the unsupervised topic models’ output. Space for users...
متن کاملLarge Scale Corpus Analysis and Recent Applications
Recent progress of corpus and machine learning-based natural language processing methodologies have made it possible to handle large scale corpus with a quite high accuracy. The speaker is now involved in a project for constructing a large scale contemporary Japanese balanced corpus, aiming at constructing automatic annotation tools on various levels of natural language analyses. I will first i...
متن کاملMining Large-scale TV Group Viewing Patterns for Group Recommendation
We present a large-scale study of television viewing habits, focusing on how individuals adapt their preferences when consuming content in group settings. While there has been a great deal of recent work on modeling individual preferences , there has been considerably less work studying the behavior and preferences of groups, due mostly to the difficulty of data collection in these settings. In...
متن کاملProject for Production of Closed-Caption TV Programs for the Hearing Impaired
We describe an on-going project whose primary aim is to establish the technology of producing closed captions for TV news programs efficiently using natural language processing and speech recognition techniques for the benefit of the hearing impaired in Japan. The project is supported by the Telecommunications Advancement Organisation of Japan with the help of the ministry of Posts and Telecomm...
متن کاملSTAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset
In recent years, automatic generation of image descriptions (captions), that is, image captioning, has attracted a great deal of attention. In this paper, we particularly consider generating Japanese captions for images. Since most available caption datasets have been constructed for English language, there are few datasets for Japanese. To tackle this problem, we construct a large-scale Japane...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Future Computer and Communication
سال: 2015
ISSN: 2010-3751
DOI: 10.7763/ijfcc.2015.v4.364